|
|
Registro Completo |
Biblioteca(s): |
Embrapa Gado de Leite. |
Data corrente: |
26/12/2018 |
Data da última atualização: |
24/01/2023 |
Tipo da produção científica: |
Artigo em Periódico Indexado |
Autoria: |
ALMEIDA, A. L.; SCHETTINO, V. J.; BARBOSA, T. J. R.; FREITAS, P. F.; GUIMARÃES, P. G. S.; ARBEX, W. A. |
Afiliação: |
WAGNER ANTONIO ARBEX, CNPGL. |
Título: |
Relative scalability of NoSQL databases for genotype data manipulation. |
Ano de publicação: |
2018 |
Fonte/Imprenta: |
Revista de Informática Teórica e Aplicada, v. 25, n. 2, p. 93-100, 2018. |
DOI: |
10.22456/2175-2745.79334 |
Idioma: |
Inglês |
Conteúdo: |
Abstract Genotype data manipulation is one of the greatest challenges in bioinformatics and genomics mainly because of high dimensionality and unbalancing characteristics. These peculiarities explains why Relational Database Management Systems (RDBMSs), the "de facto" standard storage solution, have not been presented as the best tools for this kind of data. However, Big Data has been pushing the development of modern database systems that might be able to overcome RDBMSs deficiencies. In this context, we extended our previous works on the evaluation of relative performance among NoSQLs engines from different families, adapting the schema design in order to achieve better performance based on its conclusions, thus being able to store more SNP markers for each individual. Using Yahoo! Cloud Serving Benchmark (YCSB) benchmark framework, we assessed each database system over hypothetical SNP sequences. Results indicate that although Tarantool has the best overall throughput, MongoDB is less impacted by the increase of SNP markers per individual. |
Palavras-Chave: |
Data Science; Database; NoSQL; SNP. |
Thesaurus Nal: |
Bioinformatics; Genotype. |
Categoria do assunto: |
X Pesquisa, Tecnologia e Engenharia |
URL: |
https://ainfo.cnptia.embrapa.br/digital/bitstream/item/189325/1/Artigo-RevInfTeorApl-Arbex-Relative.pdf
|
Marc: |
LEADER 01813naa a2200265 a 4500 001 2102528 005 2023-01-24 008 2018 bl uuuu u00u1 u #d 024 7 $a10.22456/2175-2745.79334$2DOI 100 1 $aALMEIDA, A. L. 245 $aRelative scalability of NoSQL databases for genotype data manipulation.$h[electronic resource] 260 $c2018 520 $aAbstract Genotype data manipulation is one of the greatest challenges in bioinformatics and genomics mainly because of high dimensionality and unbalancing characteristics. These peculiarities explains why Relational Database Management Systems (RDBMSs), the "de facto" standard storage solution, have not been presented as the best tools for this kind of data. However, Big Data has been pushing the development of modern database systems that might be able to overcome RDBMSs deficiencies. In this context, we extended our previous works on the evaluation of relative performance among NoSQLs engines from different families, adapting the schema design in order to achieve better performance based on its conclusions, thus being able to store more SNP markers for each individual. Using Yahoo! Cloud Serving Benchmark (YCSB) benchmark framework, we assessed each database system over hypothetical SNP sequences. Results indicate that although Tarantool has the best overall throughput, MongoDB is less impacted by the increase of SNP markers per individual. 650 $aBioinformatics 650 $aGenotype 653 $aData Science 653 $aDatabase 653 $aNoSQL 653 $aSNP 700 1 $aSCHETTINO, V. J. 700 1 $aBARBOSA, T. J. R. 700 1 $aFREITAS, P. F. 700 1 $aGUIMARÃES, P. G. S. 700 1 $aARBEX, W. A. 773 $tRevista de Informática Teórica e Aplicada$gv. 25, n. 2, p. 93-100, 2018.
Download
Esconder MarcMostrar Marc Completo |
Registro original: |
Embrapa Gado de Leite (CNPGL) |
|
Biblioteca |
ID |
Origem |
Tipo/Formato |
Classificação |
Cutter |
Registro |
Volume |
Status |
URL |
Voltar
|
|
Registros recuperados : 2 | |
2. | | ALMEIDA, A. L.; SCHETTINO, V. J.; BARBOSA, T. J. R.; FREITAS, P. F.; GUIMARÃES, P. G. S.; ARBEX, W. A. Relative scalability of NoSQL databases for genotype data manipulation. Revista de Informática Teórica e Aplicada, v. 25, n. 2, p. 93-100, 2018.Tipo: Artigo em Periódico Indexado | Circulação/Nível: B - 3 |
Biblioteca(s): Embrapa Gado de Leite. |
| |
Registros recuperados : 2 | |
|
Nenhum registro encontrado para a expressão de busca informada. |
|
|